Image rectification

Image rectification is a transformation process used to project two-or-more images onto a common image plane. It corrects image distortion by transforming the image into a standard coordinate system.

It is used in computer stereo vision to simplify the problem of finding matching points between images.
It is used in geographic information systems to merge images taken from multiple perspectives into a common map coordinate system.

1 Computer stereo vision
- 1.1 Transformation
- 1.2 Algorithms
2 Geographic information system
3 See also
4 References

Computer stereo vision

Stereo vision uses triangulation based on epipolar geometry to determine distance to an object.

Between two cameras there is a problem of finding a corresponding point viewed by one camera in the image of the other camera (known as the correspondence problem). In most camera configurations, finding correspondences requires a search in two-dimensions. However, if the two cameras are aligned to be coplanar, the search is simplified to one dimension - a horizontal line parallel to the baseline between the cameras. Furthermore, if the location of a point in the left image is known, it can be searched for in the right image by searching left of this location along the line, and vice versa (see binocular disparity). Image rectification is an equivalent (and more often used^[1]) alternative to perfect camera alignment. Image rectification is usually performed regardless of camera precision due to

impracticality or impossibility of perfectly aligning cameras
perfectly aligned cameras may become misaligned over time

Transformation

If the images to be rectified are taken from camera pairs without geometric distortion, this calculation can easily be made with a linear transformation. X & Y rotation puts the images on the same plane, scaling makes the image frames be the same size and Z rotation & skew adjustments make the image pixel rows directly line up. The rigid alignment of the cameras needs to be known (by calibration) and the calibration coefficients are used by the transform^[2].

In performing the transform, if the cameras themselves are calibrated for internal parameters, an essential matrix provides the relationship between the cameras. The more general case (without camera calibration) is represented by the fundamental matrix. If the fundamental matrix is not known, it is necessary to find preliminary point correspondences between stereo images to facilitate its extraction^[2].

Stereo images can also be taken with a single camera in motion. In this case the relationship of the images can have significant forward-motion components, and a linear transformation may produce severely warped images or very large images. Non-linear transformation techniques can be used to manage this difficulty^[3]^[1]^[4].

Algorithms

There are basically three algorithms for image rectification: planar rectification ^[5], cylindrical rectification^[1] and polar rectification^[3]^[4]^[6].

Geographic information system

Image rectification in GIS converts images to a standard map coordinate system. This is done by matching ground control points (GCP) in the mapping system to points in the image. These GCPs calculate necessary image transforms^[7].

Primary difficulties in the process occur

when the accuracy of the map points are not well known
when the images lack clearly identifiable points to correspond to the maps.

The maps that are used with rectified images are non-topographical. However, the images to be used may contain distortion from terrain. Image orthorectification additionally removes these effects^[7].

Image rectification is a standard feature available with commercial GIS software packages.

References

^ ^a ^b ^c Oram, Daniel (2001). "Rectification for Any Epipolar Geometry". http://www.bmva.org/bmvc/2001/papers/82/accepted_82.pdf. Retrieved 2010-06-08.
^ ^a ^b Fusiello, Andrea (2000-03-17). "Epipolar Rectification". http://profs.sci.univr.it/~fusiello/rectif_cvol/rectif_cvol.html. Retrieved 2008-06-09.
^ ^a ^b Pollefeys, Marc; Koch, Reinhard; Van Gool, Luc (1999). "A simple and efficient rectification method for general motion". Proc. International Conference on Computer Vision: 496–501. http://www.inf.ethz.ch/personal/pomarc/pubs/PollefeysICCV99.pdf. Retrieved 2011-01-019.
^ ^a ^b Lim, Ser-Nam; Mittal, Anurag; Davis, Larry; Paragios, Nikos. "Uncalibrated stereo rectification for automatic 3D surveillance". International Conference on Image Processing 2: 1357. http://www.umiacs.umd.edu/users/sernam/papers/rect.pdf. Retrieved 2010-06-08.
^ Fusiello, Andrea; Trucco, Emanuele; Verri, Alessandro (2000-03-02). "A compact algorithm for rectification of stereo pairs". Machine Vision and Applications (Springer-Verlag) 12: 16–22. doi:10.1007/s001380050120. http://profs.sci.univr.it/~fusiello/papers/00120016.pdf. Retrieved 2010-06-08.
^ Roberto, Rafael; Teichrieb, Veronica; Kelner, Judith (2009). "Retificação Cilíndrica: um método eficente para retificar um par de imagens" (in portuguese). Workshops of Sibgrapi 2009 - Undergraduate Works. http://www.matmidia.mat.puc-rio.br/sibgrapi2009/media/undergraduate_work/60067.pdf. Retrieved 2011-03-05.
^ ^a ^b Fogel, David. "Image Rectification with Radial Basis Functions". http://www.ncgia.ucsb.edu/conf/SANTA_FE_CD-ROM/sf_papers/fogel_david/santafe.html. Retrieved 2008-06-09.

R. I. Hartley (1999). "Theory and Practice of Projective Rectification". Int. Journal of Computer Vision 35: 115–127. doi:10.1023/A:1008115206617.
Pollefeys, Marc. "Polar rectification". http://www.cs.unc.edu/~marc/tutorial/node99.html. Retrieved 2007-06-09.
Linda G. Shapiro and George C. Stockman (2001). Computer Vision. Prentice Hall. pp. 580. ISBN 0-13-030796-3.